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Thus far, likelihood-based interval estimates for quantiles have not been studied in the literature 
on interval censored case 2 data and partly interval censored data, and, in this context, the use 
of smoothing has not been considered for any type of censored data. This article constructs 
smoothed weighted empirical likelihood ratio confidence intervals (WELRCI) for quantiles in a 
unified framework for various types of censored data, including right censored data, doubly cen- 
sored data, interval censored data and partly interval censored data. The fourth order expansion 
of the weighted empirical log-likelihood ratio is derived and the theoretical coverage accuracy 
equation for the proposed WELRCI is established, which generally guarantees at least 'first 
order' accuracy. In particular, for right censored data, we show that the coverage accuracy is at 
least 0(n~ 1//2 ) and our simulation studies show that in comparison with empirical likelihood- 
based methods, the smoothing used in WELRCI generally provides a shorter confidence interval 
with comparable coverage accuracy. For interval censored data, it is interesting to find that 
with an adjusted rate n -1 ^ 3 , the weighted empirical log-likelihood ratio has an asymptotic dis- 
tribution completely different from that obtained by the empirical likelihood approach and the 
resulting WELRCI perform favorably in the available comparison simulation studies. 

Keywords: bootstrap; doubly censored data; empirical likelihood; interval censored data; partly 
interval censored data; right censored data 

1. Introduction 

Since Owen (1988), the empirical likelihood method has been developed to construct tests 
and confidence sets based on the nonparametric likelihood ratio; see Owen (1990, 1991, 
2001), DiCiccio, Hall and Romano (1991), Qin and Lawless (1994), Mykland (1995) and 
Zhou (2005), among others. Studies have shown that the empirical log-likelihood ratio 
usually has an asymptotic chi-squared distribution and that the empirical likelihood ratio 
inference is of comparable accuracy to alternative methods. In particular, it is shown that 
the empirical likelihood is Bartlett-correctable for smooth function models (DiCiccio, Hall 
and Romano (1991)). 
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In survival analysis, the quantiles of a lifetime distribution are often of significant 
interest. It is known that the likelihood-based methods perform favorably in catching the 
skewness of the distribution of the statistics of interest (Owen (1988), Ren (2001)) and 
Chen and Hall (1993) showed that for the complete sample case, smoothing can improve 
the coverage accuracy of empirical likelihood-based confidence intervals for quantiles. 
But, thus far, likelihood-based interval estimates for quantiles have not been studied in 
literature for interval censored case 2 data and partly interval censored data, and, in this 
context, the use of smoothing has not been considered for any type of censored data. 
This article studies one type of smoothed weighted empirical likelihood-based interval 
estimate for the gth quantile of the lifetime distribution function F$: 

9 Q = F ~ 1 (q), 0<q<l, (1.1) 

with various types of censored data. Throughout this paper, we let X\,...,X n be an 
independently and identically distributed (i.i.d.) random sample from a continuous and 
non- negative distribution function (d.f.) Fq, but we consider the cases when such an 
i.i.d. sample is not completely observable due to censoring. Specifically, in this work, we 
consider the following types of censored data. 



Right censored sample. The observed data are 2 = (Vi,Si), i = 1, . . . , n, with 

v = \Xi, MXi<Yi, 6i = l, , 
1 \Yi, iiX i >Y h 6 t = 0, [ j 

where Y$ is the right censoring variable and is independent of Xi . This type of censoring 
has been extensively studied in the literature in the past few decades. 

Doubly censored sample. The observed data are Oi = (Vi, Si), i = 1, . . . , n, with 



(X u if Zi<Xi< Yi, Si = l, 
V i = iY i , tiXi>Yi, Si = 2, (1.3) 

I Zi, if Xi < Zi, Si = 3, 

where Yi and Zi are right and left censoring variables, respectively, and are independent 
of Xi with P{Zi < Yi} = 1. This type of censoring has been considered by Turnbull 
(1974), Chang and Yang (1987), Gu and Zhang (1993), Ren (1995) and Mykland and 
Ren (1996), among others. One recent example of doubly censored data was encountered 
in a study of primary breast cancer (Ren and Peer (2000)). 



Interval censored sample. 



Case 1. The observed data are Oi = (Yi, Si), i = 1, . . . , n, with 



S i = I{X i <Y i \. 



(1.4) 
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Case 2. The observed data are Oi = (Yi, Zi, Si), i = 1, . . . ,n, with 

rl, ttZt<Xi<Yi, 
Si={2, HX i >Y i , (1.5) 
1 3, ifA 4 <Z,, 

where Yi and Zi are independent of Xi and satisfy P{Zi < Yi] = 1 for Case 2. These two 
types of interval censoring were considered by Groeneboom and Wellner (1992), among 
others. In practice, interval censored Case 2 data were encountered in AIDS research 
(Kim, De Gruttola and Lagakos (1993); also see the discussion in Ren (2003)). 

Partly interval censored sample. 

'Case 1 ' partly interval censored data. The observed data are 

°< = i%'^ v 1 -!??'* (1-6) 

where Si = I{Xi < Y} and Yi is independent of Xj. 

General partly interval censored data. The observed data are 

Q= ( x h ifl<i<»i, (17) 

Ul \(Y,Si), ifm + l<»<n, {L -'> 

where for N potential examination times Y\ < ■ ■ ■ < Yv, letting Yq = and Y/v+i = oo, 
we have Y = (Yi, . . . , Fjv) and ^ = ((5 4 (1) , . . . , s[ N+1) ) with = 1 if Yj-_i < X, < Y, and 
elsewhere. This means that for intervals (0, Y±], (Yi, Yz], ■ ■ ■ , (Y/v, oo), we know which 
one of them Xj falls into. These two types of partial interval censoring were considered 
by Huang (1999), among others. As mentioned in Huang (1999), in practice, the general 
partly interval censored data were encountered in the Framingham Heart Disease Study 
(Odell, Anderson and D'Agostino (1992)) and in the study of the incidence of proteinuria 
in insulin-dependent diabetic patients (Enevoldsen et al. (1987)). 

Obviously, one possible way to construct a likelihood-based confidence interval for 
9q with censored data is to use the likelihood function for a specific censoring model. 
This requires careful investigation for each type of censored sample. Specifically, the 
computation of the confidence region and the asymptotic results on the coverage of 
the confidence region need to be studied for each type of censored data. For works along 
these lines, still called the empirical likelihood approach, see Li, Hollander, McKeague and 
Yang (1996), Chen and Zhou (2003) and Bancrjee and Wellner (2005) for right censored 
data, doubly censored data and interval censored Case 1 data, respectively. However, the 
methods in these works do not have direct extension to other types of censored data, 
such as interval censored Case 2 data (1.5) and partly interval censored data (1.6)— (1.7). 
Also, none of these works contains any coverage accuracy results or considers the use 
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of smoothing. Note that the coverage accuracy of empirical likelihood ratio confidence 
intervals is 0(rt _1 ) for smooth function models (DiCiccio, Hall and Romano (1991)). 
But, with censored data, we no longer have a smooth function model, thus it is quite 
difficult to study the coverage accuracy and it is particularly challenging to carry out 
the type of smoothing in Chen and Hall (1993) using the empirical likelihood approach 
for complicated types of censored data, such as doubly censored data, interval censored 
data and partly interval censored data. 

Instead of undertaking the case-by-case studies for different types of censored data 
using the empirical likelihood approach, Ren (2001) constructs confidence intervals for 
the mean based on a new likelihood function, called a weighted empirical likelihood func- 
tion, which is formulated in a unified form depending only on the probability mass of the 
nonparametric maximum likelihood estimator (NPMLE) F n for Fq. For the mean, the 
•y/n-rate of convergence still holds for the aforementioned censored data (1.2)-(1.7) and 
Ren (2001) considered the first order expansion of the log-likelihood ratio without any 
coverage accuracy results. In this article, we construct smoothed weighted empirical like- 
lihood ratio confidence intervals (WELRCI) for quantile 9q in (1.1), where the -y/n-rate 
of convergence does not hold for interval censored data (1.4)— (1.5) . Here, we derive the 
fourth order expansion of one type of smoothed weighted empirical log-likelihood ratio 
in a unified form for different types of censored data, including all of (1.2)-(1.7). With 
an analytically expressed leading term, this expansion leads to the theoretical coverage 
accuracy equation for the confidence intervals, which generally guarantees at least 1 first 
order^ accuracy. This expansion also leads to the following results. 

(a) When F n has -^/n-rate of convergence, such as in the cases of right censored data, 
doubly censored data and partly interval censored data, the expansion shows that the log- 
likelihood ratio has an asymptotic scaled chi-squared distribution and the leading term 
of the expansion allows the n out of n bootstrap calibration for constructing confidence 
intervals in practice. Our theory shows that smoothed WELRCI is generally consistent; 
in particular, we show that for right censored data, the coverage accuracy of WELRCI 
is at least 0(n -1 / 2 ). Our simulation studies show that in comparison with empirical 
likclihood-bascd methods, the smoothing used in WELRCI generally gives a shorter 
confidence interval with comparable coverage accuracy. 

(b) When F n has n 1 / 3 -rate of convergence, such as in the case of interval censored 
data, the expansion shows that with an adjusted rate of n -1 / 3 , the log-likelihood ra- 
tio has an asymptotic scaled 1? distribution, where n 1 / 3 [F n (6'o) — Fq(6q)] -—ycoZ; see 
(2.4) of Section 2 for details. It is interesting to notice that such a limiting distribution 
of the log-likelihood ratio is completely different from that obtained by the empirical 
likelihood approach for interval censored data (Banerjee and Wcllner (2001, 2005)). In 
other words, the weighted empirical likelihood approach preserves the 'squared' struc- 
ture of the limiting distribution of the log-likelihood ratio for all types of aforementioned 
censored data, while the empirical likelihood approach loses this structure for interval 
censored data. Moreover, for interval censored data, the aforementioned expansion of the 
weighted empirical log-likelihood ratio allows the use of the m out n bootstrap (Bickcl, 
Gotze and van Zwet (1997)) or subsampling (Politis, Romano and Wolf (1999)) cali- 
bration for constructing confidence intervals. With an adaptive choice of the bootstrap 
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sample sizes which is similar to those by Gotze and Rackauskas (2001) or Bickel and 
Sakov (2008), our simulation results for WELRCI using the m out n bootstrap compare 
favorably with those of Banerjee and Wellncr (2005) for interval censored Case 1 data, 
though our method is computationally very time consuming for large n. 

Regarding the main results of this article, an additional two points are worth mentioning. 

(i) When there is no censoring, the bootstrap calibration of empirical likelihood has 
been studied by various authors (see the review in Owen (2001), Sections 3.3 and 3.17) 
and it is shown to have better coverage accuracy than standard chi-squared distribution 
calibration. However, for different types of censored data (1.2)-(1.7), it is not obvious how 
to generally implement the bootstrap calibration described in Owen (2001); for instance, 
it is not obvious how to correctly apply the n out of n bootstrap directly to the weighted 
empirical likelihood ratio, which itself is a solution of an optimization problem. Here, 
the expansion of the weighted empirical log-likelihood ratio provides a natural way to 
generally implement bootstrap calibration for censored data. 

(ii) It is well known that the computation of likelihood-based confidence intervals is 
quite difficult in general. Here, the algorithm for computing WELRCI depends only on 
the NPMLE F n ; that is, the routine itself is the same for different types of censored data. 
Thus, the WELRCI avoids complicated computation problems case- by-case. 

The rest of this paper is organized as follows. Section 2 introduces the weighted empir- 
ical likelihood function with a brief review of the asymptotic properties of the NPMLE 
F„ for Fq. Section 3 constructs one type of smoothed WELRCI for quantile 8q and 
gives related asymptotic results with proofs deferred to Section 6. Section 4 discusses 
the implementation of WELRCI in practice. Section 5 presents some simulation results 
and comparisons between the proposed procedure and alternative methods, and includes 
some concluding remarks. 

2. Weighted empirical likelihood 

In Owen (1988), the empirical likelihood function is given by 



where F is any d.f., and the empirical likelihood ratio function is given by R(F) = 
L(F)/L(F n ) because the empirical d.f. F n of sample X\, . . . , X n is the nonparametric 
maximum likelihood estimator (NPMLE) for Fq; that is, F n maximizes L(F) over all 
distribution functions F. The weighted empirical likelihood function in Ren (2001) is 
given as follows. 

For each type of aforementioned censored data (1.2)-(1.7), the NPMLE F n for Fq 
based on observed data can be expressed as 



n 



L(F) = l[[F(X l )-F(X l -)l 



(2.1) 



2=1 



77 i 




(2.2) 



i=i 
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where W\ < W2 < ••• < W m with pj > 0, 1 < j < m. Specifically, in the case of right 
censored data, Wi's are those non-censored observations in (1.2) and m is the total 
number of non-censored observations among Vi < ■ ■ ■ < V n ) see Kaplan and Meier (1958) 
or Shorack and Wellncr (1986), page 293. In the case of doubly censored data, Wi's 
include all those non-censored observations among V\ <■■■ <V n in (1.3), but certain Wj 
could be a 14 with, say, 5k = 3; see Mykland and Ren (1996). For doubly censored data, 
the NPMLE F n is implicitly, but uniquely, determined by a particular solution of an 
integral equation; in turn, Wi's and pi's are uniquely determined and can be obtained 
through computing F n (Mykland and Ren (1996)). In the case of interval censored Case 2 
data (1.5), the NPMLE F n is also implicitly, but uniquely, determined by a particular 
solution of an integral equation; in turn, Wi's and pi's are uniquely determined and can 
be obtained through computing F n ; see Groeneboom and Wellner (1992) pages 43-46. 
For interval censored Case 2 data, {Wi, . . . , W m } is a subset of {Yi, . . . , Y n , Z\, . . . , Z n } 
in (1.5). Similarly, for interval censored Case 1 data (1.4) and partly interval censored 
data (1.6)— (1.7), Wi's and pi's in (2.2) are also uniquely determined by computing F n ; 
see Groeneboom and Wellner (1992), pages 35-41 and Huang (1999). 
The weighted empirical likelihood function (Ren (2001)) is given by 

m 

w-ntw)- w-)] n ^ ( 2 - 3 ) 

and it is shown (Ren (2008)) that L(F) may be viewed as the asymptotic version of 
the empirical likelihood function L(F) for censored data. It is easy to show that L(F) 
is maximized at F n . Hence, the weighted empirical likelihood ratio is given by R(F) = 
L(F)/L(F n ). One may notice that when there is no censoring, the weighted empirical 
likelihood function (2.3) coincides with Owen's empirical likelihood function (2.1); see 
Ren (2001) for details. 

Remark 1. Asymptotic results for the NPMLE F n . It is known that — Foj| ^>0 
as n—f 00 for right censored data (Stute and Wang (1993)), doubly censored data (Gu 
and Zhang (1993)), interval censored data (Groeneboom and Wellner (1992)), and partly 
interval censored data (Huang (1999)), respectively. It is also known that for right cen- 
sored data, doubly censored data and partly interval censored data, y/n(F n — Fo) weakly 
converges to a centered Gaussian process under certain conditions (Gill (1983), Gu and 
Zhang (1993), Huang (1999)). However, for interval censored Case 1 data (1.4), we have 
that for a fixed point to, 

n 1/3 [F n (to)-F (*o)]^*coZ asrwc», (2.4) 

where Co is a constant and Z = argmin(W / (t) +t 2 ) with W being the two-sided Brownian 
motion starting from (Groeneboom and Wellncr (1992)). For interval censored Case 2 
data (1.5), (2.4) also holds under certain conditions (Wellncr (1995)). Note that (2.4) 
accents for why a -y/n-rate of convergence does not hold for quantile estimators with 
interval censored data. 



Weighted empirical likelihood ratio confidence intervals 



731 



3. Weighted empirical likelihood ratio confidence 
intervals 

In this section, we show that the set S„ = {F -1 (q)\R(F) > c n , F <C F n } may be used as 
confidence interval for the quantilc 6$ given in (1.1), where < c n < 1 is a constant, F 
is a smoothed version of F, as given in equation (3.2) below, and l F -C F n ' means that 
F is absolutely continuous with respect to F n . 

First, note that the NPMLE F n for censored data (1.2)-(1.7) is not always a proper d.f. 
(Mykland and Ren (1996)), but, in this work, we always consider the adjusted version of 
the NPMLE, still denoted F n . Precisely, for the rest of this paper, F n in (2.2) denotes the 
proper d.f. obtained by setting 1 as the value of the NPMLE at the largest observation 
of the data set, that is, setting F n = 1 at V(„) , YL) , max-{Y( n ),Zr n )} or max{JQ's, Yj's}, 
which implies that YliLiPi = 1 in (2.2). This kind of adjustment of the NPMLE is a 
generally adopted convention for censored data (Efron (1967); Miller (1976)). Although 
this F n no longer necessarily maximizes the underlying likelihood function, the usual 
asymptotic properties of the NPMLE needed for this work still hold for this F n ; see the 
later discussion in Remark 2. 

To study the confidence set S n , we let p, = F(Wi) — F(Wi~), 1 < i < m, and let 

r(fl)=Bup|n(Pi/Pi) n * 4 \ Fp\q)=0,F p €d n \, (3.1) 

where ff„ = {F \ F(x) = EtiPiHWj < x}, Pl > 0, E7=iPi = 1}. and for W Q = 0, W = 
(W\, . . • , W m ) and p = (p 1 , . . . ,p m ), F p is a smoothed version of F p by connecting adja- 
cent jump points through straight lines for < x < W m : 



'i-l 



F p (x) =J2HWi-i <x<wM Y,P 3 + ^ ||: ^ ) ^PiHiOV, x) (3.2) 

i=i \j=i 1 1 1 / i=i 

with Hi(W,x) = ^^."^ /{Wi_i < x < Wi} + I{Wi < x}. The proof of the last equation 
in (3.2) is based on straightforward algebra, which is omitted for brevity. In the Appendix, 
we show that S n is an interval satisfying S n = [X^ , Xjj] and 

X L <e <Xu if and only if r(6 ) > c„, (3.3) 

where 



X L = mil F p \q) | F p € ^Ui^r- > 

( m 
Xu = SUpl F p \q) F p e Sn, l[(Pi/Pi) nPi > C n 



(3.4) 
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We call [Xl,Xu] the smoothed weighted empirical likelihood ratio confidence interval 
(WELRCI) for the quantile 9 . Note that (3.3) does not hold if, in (3.1), we use F~ 1 (q) = 9 
in place ofF~ 1 (q) = 9 because F~ 1 (q) = 9 is not equivalent to F p (0) = q. Also, note that 
other types of smoothing, such as the kernel density estimator method (Chen and Hall 
(1993), Ren (2006)), may be considered in (3.1). The smoothing issue will be discussed 
later in Remark 4. 
Since (3.3) implies 

P{X L <9 < Xu} = P{-2logr(9 ) < -21ogc„}, (3.5) 

the asymptotic behavior of [Xl,Xjj] is studied via the weighted empirical log- likelihood 
ratio logr(#o) in Theorem 1 with proofs given in Section 6. In Theorem 1, we let 

$ = FZH<l) and V = F n (0o), (3.6) 
where F n denotes the smoothed version of F n according to (3.2) and we let 

m 

Afc = Afc(#o) and fi k {9) = Y / P l [H t (W,9)-q} k , k = l,2,.... (3.7) 

t=i 

Theorem 1. Assume that for a sequence C n — > oo, we have that as n — > oo, 

C n (f)-q) = O p (l), (AS1) 

f, ^ q, (AS2) 

A 2 ^ q(l - q). (AS3) 

Then: 

(i) with probability 1, we have that for fixed k = 0, 1, 2, 3, 4, 

-21ogr(0 o )=5W +n(f ] -q) k+3 r n ^ |r„, fc | < M r , fc , (3.8) 
all but finitely often, where 1 < M r ^ < oo is a constant and 

B^ = ^^(l + i:^V- q )^ (3-9) 

with Bn^ = n{f] — q) 2 1 fj,2 and 

oi = {2fL 3 )/(3fj,D, a 2 = Aa 4 (A3 ~ 5A2A4), 

a 3 = 2aV 6 (M3 + 5AIA5 - A2A3A4), (3.10) 
a 4 = A2~ 8 (tA3 - 3A2A6 + A2A4 + 2A2A3A5 - 7A2A3A4); 
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(ii) assuming that c n is chosen such that c n = n (C n ) (— 21ogc„) = 0(1) and as- 
suming that 

[C n (fl — <?)] 2 /A2 has a limiting distribution Go, (AS4) 

where Go is continuous with bounded derivative on some finite interval [a,b], which sat- 
isfies [c n — 5, c n + S] C [a, b] for some a > 0, S > and sufficiently large n, then, 

P{X L <9 < X v } = P{A^ < c n } + 0(\\F n . k - Go\\ la .b]) + 0((C„)" (fc+1) ), (3.11) 
where F n _k is the d.f. of =n~ 1 {C n ) 2 B^ and \\ ■ ||uw is the uniform norm on [a,b]. 

In practice, we let pn}a be the (1 — a)100th percentile of in (3.11) for < a < 1 
and then [X^,X^] computed by (3.4) with constant c„ set by 

-21ogc„ = n(C n )- 2 p^ c„ = pW (3.12) 

is called the kth order WELRCI (jfc-WELRCI) for O . Thus, from (3. 1 1)-(3. 12) , we have 
the theoretical coverage accuracy eguation 

P{X<£ ] <9 < X ( v k) } = (l-a) + 0(\\F n , k - Go|| [„,*]) + 0((C n )-^), (3.13) 

where the convergence rate of Hi^n.fe — Co || [«,&] is referred to as the ' first order' . In 
Remark 5 and Section 4, the coverage accuracy issue of A;- WELRCI and the estimation 
of pn}a in practice will be discussed, respectively. 

As mentioned in Section 1, for censored data, we no longer have a smooth function 
model and at present, the coverage accuracy of likelihood-based confidence intervals is 
unknown. With the proof deferred to Section 6, the next corollary shows that (3.8) and 
(3.11) can help us study the coverage accuracy of fc- WELRCI. 

Corollary 1. Under the assumptions of Theorem 1 and the assumption in (2.1) of Chen 
and Lo (1996), the coverage accuracy of WELRCI for the quantile 9o with right censored 
data is at least OlnT 1 / 2 ), that is, with k — in (3.11) and c n set by (3.12), we have 

P{X L 0) <9 < X^} = (1 - a) + 0(n-V 2 ). (3.14) 

Remark 2. Assumptions of Theorem 1. Since F n is an increasing function on the support 
of F n and since 

||-F n - FJ < sup\F n {x) - F n{x - } \ = 0(\\F„ - Foil), (3.15) 

x 

we have q = i*o(#o) = F n (9) and that the asymptotic properties of (?) — q) = [F n (8o) — 
Fo(9q)) are determined by those of [F n (9o) — Fo(9q)]. From Remark 1, we know that 
under suitable conditions, (AS2) holds for all those censored data (1.2)-(1.7). Also, it 
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is easy to show that ||F„ — FoH^-^O implies (AS3). For (AS1), note that whenever 
it is known, C„ is meant to be the convergence rate of f). Thus, for right censored 
data, doubly censored data and partly interval censored data, from Remark 1, we know 
that y/n[F n (8o) — Fo(0o)] has an asymptotic normal distribution; in turn, under suitable 
conditions, (AS1) and (AS4) hold with G„ = y/n. Moreover, for right censored data and 
doubly censored data, we have \F n (8 ) — F n (9 )\ =O a .s.( n_1 ) (R- Cn (1997)), which implies 
that yfn[F n (9 ) — -£0(6*0)] and y/n[F„(8o) — Foi^o)] have the same limiting distribution; 
in turn, Go is the d.f. of poXi> where po is some constant and Xi is a chi-squared random 
variable with one degree of freedom. For interval censored Case 1 or Case 2 data, we 
have (2.4) under certain conditions, and from Groeneboom and Wellner (1992), we could 
expect n^FniWj) - F n (Wj-)] = o p (l) for Wj-i < 9 < Wj, thus (AS1) and (AS4) 
hold with C n = n 1 / 3 when (2.4) holds, where Go is the d.f. of 70Z 2 for some constant 
7o- On the other hand, if smoothing is not used in (3.1), that is, replacing F~ 1 (q) = 6 
by Fp(Q) = q in (3.1), a modified proof of Theorem 1 shows that (3.8)-(3.11) hold with 
V = Fn(@o)i f° r which (AS1) and (AS4) hold with C n = 71 1 / 3 for interval censored data 
whenever (2.4) holds. In our simulation studies presented in Section 5, we denote this 
non-smoothed WELRCI as WELRCIO. Finally, note that (3.8)-(3.9), (AS3)-(AS4), the 
continuity of Go and Polya's theorem imply that ||-Fn,fc — Go||[ a ,6] — ► as n — > 00. Thus, 
if c n is set by (3.12), we have c„ = O(l) in (3.11). 

Remark 3. Limiting distribution of log-likelihood ratio. When F n has y/n-v&te of con- 
vergence, such as in the cases of right censored data, doubly censored data and partly 
interval censored data, from (3.8)-(3.9), |6fc| < 1, (AS3) and Remark 2, we know that 

the weighted empirical log-likelihood ratio — 21ogr(6> ) — *PoXi; where po = 1 when 
there is no censoring. Similarly, when F„ has n 1 / 3 -rate of convergence, such as in 
the case of interval censored data, we know that (3.8)-(3.9), (2.4) and Remark 2 give 

n _1 / 3 [— 2 logr(#o)] -~^7o^ 2 , while the empirical log-likelihood ratio of Banerjee and Well- 
ner (2001), page 1701; (2005), page 411 converges in distribution to D, which is not 
proportional to Z 2 . 

Remark 4- Smoothing and theoretical coverage accuracy of WELRCI. From (6.21) in 
the proof of Theorem l(ii), it is easy to see that if F n ^ satisfies the Lipschitz condition 
\F n ,k{x) — F n ^{y) \ < Mp\x — y\ in the neighborhood of c„ for some constant Mp and 
sufficiently large n, then the term 0(\\F n ,k — Go\\[ a .b]) m equation (3.13) disappears, that 
is, (3.13) becomes the following best possible theoretical coverage accuracy equation: 

P{X ( L k) <9 Q < X { v k) } = (1 - a) + 0((G rl )-( fe+1 >). (3.16) 

Note that the use of F p in (3.1) leads to f) = F n (9o) in (3.6) and it can be shown that 
without smoothing, that is, using F p (9) = q in place of F~ 1 (q) = 9 in (3.1), Theorem 1 has 
f] = F n (9o), for which the Lipschitz condition does not hold. On the other hand, in Ren 
(2006), it is shown that if F n is based on the kernel density method, then the Lipschitz 
condition for F n ^ holds asymptotically when there is no censoring. The implication of 
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this is that it is possible to have (3.16) for censored data with an appropriate smoothing 
of F n . Due to this understanding on the effects of smoothing F n , we consider a very 
simple smoothed version F p of F p in (3.1) and the simulation results compare favorably 
with alternative methods. However, in general, the verification of the Lipschitz condition 
can be quite involved for censored data; this will be studied in a separate paper. Thus, 
for now, we may only view (3.16) as the theoretically best possible for the fc- WELRCI, 
which will be used in the next section to select fc in practice. It should be noted that 
(3.16) suggests that the use of the fourth order expansion in (3.8) is sufficient because 
k = 4 and C n = *Jn give 0(n -5 / 2 ) for the last term in (3.16), while the coverage accuracy 
with Bartlett-correction is only 0(n~ 2 ) for smooth function models (DiCiccio, Hall and 
Romano (1991)). 

Remark 5. Coverage accuracy. The coverage accuracy equation (3.13) is only theoret- 
ical, because the actual coverage accuracy in practice includes the estimation error for 
p n 'a- This means that the actual coverage accuracy can be established via the rate of 
\\Fn,k — Golluw and the estimation error of pii) a . But, if we have (3.16) via an appropri- 
ate smoothing, the coverage accuracy is determined only through the estimation error 
of p^a. In the case of right censored data, our (3.14) in Corollary 1, along with the 
bootstrap estimation error results established in Chen and Lo (1996), ensures the actual 
coverage accuracy of our WELRCI to be at least 0(n -1 / 2 ). For other types of censored 
data (1.3)-(1.7), our equations (3.12)-(3.13) and (3.16) indicate the direction of further 
studies on the actual coverage accuracy and provide guidance on the implementation of 
our smoothed WELRCI in practice, which is discussed in the next section. 

4. Implementation 

Estimation of 

Since the computation of [Xl,Xjj] in (3.4) only depends on the constant c n , (3.12) 
implies that fc-WELRCI [X^\ X)j] for the quantile can be computed if p^} a can be 
estimated consistently for an appropriate fc. 

For right censored data, doubly censored data and partly interval censored data, we 
have C n = y/n and, in turn, A„ = in (3.11) can be expressed as 




(4.1) 




(4.2) 



736 



J.- J. Ren 



for b j (6)=a j (e)/(C n ) :j and fj*=F*(§) > where for fi k {§) given by (3.6)-(3.7), aj(6) are 
calculated by (3.10) with /ifc's replaced by /}&(#) 's and F* is calculated based on the n out 
of n bootstrap method (Efron (1979)). That is, if p n k }a denotes the (1 — o;)100th percentile 

\ (k)* . . (k) . 

of A n , we use it to estimate p n / a in practice. Note that the asymptotic properties of 
(v~ o) = [F n (@o) —Fo(0o)] are the same as those of [F n (6o) —Fo(@o)] due to the smoothing 
method used in (3.2). Thus, from Bickel and Ren (1996) and Huang (1999), the bootstrap 
consistency holds here because it is easy to show that \\F n — Fq\\ -^>0 implies ( —f9$ 
and \fik{0) — /tfc| -'^0; in turn, \&j(9) — &j\ ^~i-0 as n — > oo. 

For interval censored data, if (2.4) holds, we have C„ = n 1 / 3 in (4.1) and the dis- 
tribution of A^f 1 =n~ 1 / 3 Bn may be estimated by that of T n (nl^ 3 (fi nb — q)), where 
f}* fc = F* b (9) is calculated based on the subsampling method (Politis, Romano and Wolf 
(1999), Theorem 2.2.1) or the m out of n bootstrap method (Bickel, Gotze and van 

Zwet (1997)), using rib a s the resampling size. In this CcLSC, pro, a: is estimated by the 

1/3 

(1 — a)100th percentile of f n (n b ' (j)* — qj). With an adaptive choice of the bootstrap 
sample size rife, the m out of n bootstrap performs very well in our simulation studies, 
as shown in Section 5. 

Selection of k 

Note that Corollary 1, along with Remark 5, shows that the coverage accuracy of WEL- 
RCI is at least as good as that achieved by normal-based methods for right censored 
data. But, this may not be the exact coverage accuracy. In fact, when there is no censor- 
ing, the term 0(\\F„^ — Go\\[ a ,b]) (which usually has rate n -1 / 2 ) in (3.13) does not exist 
because the coverage accuracy is 0(n -1 ) for smooth function models (DiCiccio, Hall and 
Romano (1991)). Thus, based on Remark 4, we know that, at best, we may expect the 
theoretical coverage accuracy equation (3.16) to hold for fc-WELRCI, which may be used 

(k) 

to select k in practice. Since the estimation accuracy for p n ' a via the usual bootstrap 
method cannot be better than O p (n _1 ) (Hall (1992)), the criterion we recommend for 
selecting k in practice is to choose the smallest k such that 

(C n )-( fe+1 )<n- 1 , (4.3) 

which implies that n(C n ) _ ^ fc+3 ' ) = oil) in (3.8). Thus, for right censored data, doubly 
censored data and partly interval censored data, we have C n = y/n, for which (4.3) 
implies k = 2; for interval censored Case 1 data (1.4), we have C„ = n 1 / 3 , for which (4.3) 
implies k = 3. 

Computation 

A routine in FORTRAN for computing [X { *\ X^ } ] based on (3.4) is available fr om the 
author, but for the brevity of this article, the theorems on the convergence of this routine 
arc omitted. 
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Table 1. 90% confidence intervals for 0o = f 1 (q) with exponential right censored data 
X ~ Exp(l), Y ~ Exp(3); percentage of 8: 8 = 1: 75.0%; 5 = 0: 25.0% 



n = 50 n = 100 n = 200 



Coverage Average Coverage Average Coverage Average 



9 


& So 


Method 


% 




length (s.d.) 


% 




length (s.d.) 


% 




length (s.d.) 


9 


= 0.250 


1-WELRCI 


90 


1 


0.272 


(0.090) 


89 


9 


0.191 


(0.051) 


89 


8 


0.136 


(0.032) 




= 0.288 


2-WELRCI 


90 


5 


0.273 


(0.090) 


89 


9 


0.192 


(0.051) 


89 


8 


0.136 


(0.032) 






LHMYCI 


90 


1 


0.285 


(0.095) 


90 


3 


0.196 


(0.052) 


90 


2 


0.138 


(0.032) 






SQBPCI 


90 


4 


0.267 


(0.086) 


90 


6 


0.190 


(0.049) 


90 





0.136 


(0.032) 






QBPCI 


89 


4 


0.286 


(0.098) 


90 





0.196 


(0.053) 


89 


5 


0.138 


(0.033) 


9 


= 0.500 


1-WELRCI 


89 


8 


0.480 


(0.151) 


91 





0.344 


(0.091) 


90 





0.242 


(0.057) 


(h 


= 0.693 


2-WELRCI 


89 


4 


0.473 


(0.150) 


90 


7 


0.342 


(0.091) 


90 





0.241 


(0.057) 






LHMYCI 


89 


9 


0.513 


(0.167) 


91 


8 


0.357 


(0.094) 


90 





0.246 


(0.057) 






SQBPCI 


88 


5 


0.467 


(0.147) 


90 


6 


0.341 


(0.089) 


89 


4 


0.241 


(0.056) 






QBPCI 


89 


2 


0.506 


(0.169) 


91 


6 


0.354 


(0.095) 


89 


7 


0.245 


(0.058) 


9 


= 0.750 


1-WELRCI 


88 


6 


0.860 


(0.329) 


88 


4 


0.637 


(0.211) 


90 





0.460 


(0.128) 




= 1.386 


2-WELRCI 


88 


<s 


0.866 


(0.331) 


88 


7 


0.639 


(0.211) 


90 





0.461 


(0.128) 






LHMYCI 


90 


5 


1.057 


(0.512) 


90 





0.706 


(0.251) 


91 


3 


0.484 


(0.136) 






SQBPCI 


87 


2 


0.832 


(0.297) 


88 





0.624 


(0.201) 


89 


4 


0.457 


(0.123) 






QBPCI 


90 


7 


1.041 


(0.499) 


89 


1 


0.684 


(0.243) 


91 


2 


0.476 


(0.136) 



Table 2. 90% confidence intervals for 6o = F x (q) with chi-squared right censored data 
A~x 2 (l), V~Exp(3); percentage of 8: 8=1: 77.5%; 5 = 0: 22.5% 



n = 50 n = 100 n = 200 



Coverage Average Coverage Average Coverage Average 
q & O Method % length (s.d.) % length (s.d.) % length (s.d.) 



q = 0.250 


1-WELRCI 


89 


5 





172 


(0.075) 


89.5 





123 


(0 


044) 


90.0 





087 


(0.026) 


O = 0.102 


2-WELRCI 


89 


8 





173 


(0.076) 


89.7 





124 


(0 


044) 


90.0 





087 


(0.026) 




LHMYCI 


90 


4 





187 


(0.083) 


88.9 





128 


(0 


045) 


90.1 





089 


(0.026) 




SQBPCI 


89 


7 





169 


(0.074) 


89.4 





123 


(0 


043) 


90.1 





087 


(0.026) 




QBPCI 


88 


8 





191 


(0.087) 


89.0 





128 


(0 


047) 


90.0 





090 


(0.027) 


q = 0.500 


1-WELRCI 


89 


8 





500 


(0.200) 


88.7 





359 


(0 


110) 


90.1 





253 


(0.063) 


O = 0.455 


2-WELRCI 


89 


2 





492 


(0.197) 


88.4 





356 


(0 


109) 


89.9 





252 


(0.063) 




LHMYCI 


89 


9 





546 


(0.231) 


88.9 





374 


(0 


117) 


89.4 





259 


(0.064) 




SQBPCI 


88 


4 





484 


(0.193) 


89.0 





354 


(0 


108) 


90.1 





252 


(0.062) 




QBPCI 


89 


1 





542 


(0.230) 


88.8 





371 


(0 


118) 


89.8 





258 


(0.065) 


q = 0.750 


1-WELRCI 


85 


9 


1 


131 


(0.465) 


87.1 





847 


(0 


308) 


88.4 





630 


(0.184) 


O = 1.323 


2-WELRCI 


86 


5 


1 


138 


(0.467) 


87.1 





850 


(0 


308) 


88.4 





631 


(0.184) 




LHMYCI 


85 


5 


1 


438 


(0.871) 


90.0 





963 


(0 


403) 


89.6 





669 


(0.202) 




SQBPCI 


84 


4 


1 


083 


(0.428) 


86.1 





831 


(0 


287) 


88.2 





625 


(0.178) 




QBPCI 


86 


8 


1 


470 


(0.830) 


89.0 





943 


(0 


402) 


89.6 





660 


(0.201) 
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5. Simulation studies and concluding remarks 

Some simulation results for Theorem 1 are presented in this section. Here, letting Exp(/i) 
and x 2 {r) represent the exponential d.f. with mean /i and the chi-squared d.f. with 
degrees of freedom r, respectively, the order fc for smoothed fc-WELRCI is selected based 
on (4.3) and [X^/,X^] is computed by the algorithm mentioned in Section 4. In all of 
our simulation studies here, the EM algorithm is used to compute the NPMLE F n for 
doubly censored data and interval censored data, and the stopping rule used is when the 
uniform distance between two consecutive iterations is less than 0.001. 

Right censored data and doubly censored data 

Since the empirical likelihood-based confidence intervals (C.I.) for quantilcs were con- 
sidered by Li, Hollander, McKcague and Yang (1996) for right censored data (abbrevi- 
ated as LHMYCI), we make comparisons between WELRCI and LHMYCI. Moreover, 
other procedures, such as bootstrap percentile confidence intervals (Efron and Tibshirani 
(1993)) for sample quantilcs F 1 ^ 1 (q) (abbreviated as QBPCI) and for smoothed quan- 
tiles 9 = F~ 1 {q) in (3.6) (abbreviated as SQBPCI), are also considered in our studies. 
In Table 1, 1000 right censored samples (1.2) of size n = 50 are taken from exponential 
distributions and for each sample, 90% fc-WELRCI, LHMYCI, SQBPCI and QBPCI for 
6*o = F a ' 1 (q) with different q are computed, where 400 bootstrap samples of size n = 50 
are used for SQBPCI, QBPCI, and for estimating p£} a in (3.12) to construct smoothed 
fc-WELRCI. The simulation coverage is included in Table 1, and the simulation standard 
deviation (s.d.) of the length of C.I. is given in the parentheses next to the average length 
of C.I. Table 1 also includes the results of the same studies with n = 100 and n = 200, re- 
spectively. The simulation studies in Table 1 arc repeated in Table 2 with Fo = x 2 (l) an d 
repeated in Tables 3 and 4 with exponential and chi-squarcd doubly censored samples 
(1.3), respectively. 

From Tables 1-2, we see that for right censored data, WELRCI and LHMYCI have 
comparable coverage accuracy, but LHMYCI, while not always having better coverage, 
are noticeably wider than WELRCI for moderate sample size, say, n = 50. For right 
censored data and doubly censored data, WELRCI have coverage accuracy similar to 
QBPCI, but are noticeably shorter than QBPCI for moderate sample size n; see Tables 
1-4. 

Interval censored Case 1 data 

In Banerjee and Wellner (2005), simulation results on the empirical likelihood-based 
confidence intervals (abbreviated as BWCI) for the median with interval censored Case 1 
data (1.4) are presented in their Table 4, where X and Y both have Exp(l) distribution. 
Here, we include some of our simulation results on smoothed WELRCI and non-smoothed 
WELRCI0 (see Remark 2) in Table 5 and compare them with BWCI, noting that for 
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Table 3. 90% confidence intervals for 9q = F 1 (q) with exponential doubly censored data 
X ~ Exp(l), Y ~ Exp(3), Z = (2/3)Y - 2.5; percentage of 8: 5 = 1 : 56.0%; 5 = 2: 24.9%; 5 = 3: 
19.1% 



n = 50 n = 100 n = 200 



Coverage Average Coverage Average Coverage Average 



g & 


^0 


Method 


% 




length (s.d.) 


% 




length (s.d.) 


% 




length (s.d.) 


<7 = 


0.250 


1-WELRCI 


89 


5 


0.306 


(0.104) 


90 


3 


0.214 


(0.063) 


89 


9 


0.153 (0.038) 


6»o = 


= 0.288 


2-WELRCI 


89 


8 


0.308 


(0.105) 


90 


6 


0.215 


(0.063) 


89 


9 


0.153 (0.038) 






SQBPCI 


89 


4 


0.298 


(0.102) 


90 


2 


0.212 


(0.062) 


89 


9 


0.152 (0.037) 






QBPCI 


89 


6 


0.324 


(0.116) 


89 





0.221 


(0.068) 


90 


2 


0.155 (0.039) 


<7 = 


0.500 


1-WELRCI 


90 


6 


0.531 


(0.175) 


90 


3 


0.375 


(0.105) 


89 


9 


0.264 (0.063) 


e = 


= 0.693 


2-WELRCI 


90 





0.520 


(0.174) 


89 


7 


0.371 


(0.104) 


89 


8 


0.263 (0.063) 






SQBPCI 


89 


3 


0.514 


(0.169) 


89 


4 


0.369 


(0.103) 


89 


1 


0.263 (0.063) 






QBPCI 


89 


7 


0.557 


(0.195) 


91 


6 


0.389 


(0.112) 


90 


5 


0.269 (0.065) 




0.750 


1-WELRCI 


89 


8 


1.017 


(0.463) 


89 


3 


0.711 


(0.264) 


90 


1 


0.497 (0.144) 




= 1.386 


2-WELRCI 


90 





1.025 


(0.466) 


89 


4 


0.714 


(0.266) 


90 


2 


0.498 (0.144) 






SQBPCI 


88 


7 


0.982 


(0.422) 


87 


6 


0.706 


(0.263) 


88 


7 


0.496 (0.142) 






QBPCI 


90 


3 


1.155 


(0.621) 


89 


6 


0.751 


(0.294) 


91 


4 


0.516 (0.153) 



WELRCI0, high-order expansion is not relevant since smoothing of F n is not used; see 
Remark 4. In Table 5, the simulation results for BWCI are taken directly from Table 4 



Table 4. 90% confidence intervals for do = F 1 (q) with chi-squared doubly censored data 
X ~ X 2 (i), Y ~ Exp(3), Z = (2/3)Y - 2.5; percentage of 5: 5 = 1: 57.2%; 5 = 2: 22.5%; 5 = 3: 
20.3% 











n = 50 






71= 100 




n= 200 








Coveraj 


;e Avera 




Coverage Average 


Coverage Average 


q & 


e 


Method 


% 


length (s.d.) 


% 


length (s.d.) 


% 


length (s.d.) 


<7 = 


0.250 


1-WELRCI 


89.5 


0.195 


(0.093) 


88.8 


0.139 (0.055) 


88.3 


0.098 (0.030) 


e -- 


= 0.102 


2-WELRCI 


90.0 


0.196 


(0.093) 


89.0 


0.139 (0.055) 


88.3 


0.098 (0.030) 






SQBPCI 


89.1 


0.190 


(0.090) 


88.9 


0.136 (0.053) 


88.0 


0.097 (0.029) 






QBPCI 


88.5 


0.218 


(0.104) 


88.2 


0.147 (0.057) 


87.8 


0.100 (0.031) 




0.500 


1-WELRCI 


89.2 


0.543 


(0.224) 


88.4 


0.389 (0.128) 


90.1 


0.278 (0.074) 


e -- 


= 0.455 


2-WELRCI 


88.0 


0.533 


(0.222) 


87.8 


0.386 (0.128) 


89.9 


0.277 (0.074) 






SQBPCI 


87.1 


0.521 


(0.219) 


87.8 


0.381 (0.126) 


88.8 


0.275 (0.073) 






QBPCI 


89.3 


0.586 


(0.249) 


88.1 


0.406 (0.140) 


89.8 


0.284 (0.078) 


q = 


0.750 


1-WELRCI 


85.4 


1.368 


(0.754) 


86.9 


0.928 (0.371) 


89.2 


0.674 (0.208) 




= 1.323 


2-WELRCI 


85.9 


1.378 


(0.754) 


87.1 


0.931 (0.371) 


89.3 


0.675 (0.208) 






SQBPCI 


85.3 


1.325 


(0.672) 


86.9 


0.929 (0.374) 


88.3 


0.672 (0.209) 






QBPCI 


88.2 


1.575 


(0.924) 


88.3 


1.006 (0.425) 


88.8 


0.707 (0.227) 
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of Banerjee and Wellncr (2005). In our simulation studies, c„ in (3.4) is determined 
according to the procedure described in Section 4, where for fc- WELRCI or 1-WELRCI0, 
we use the following adaptive choice of the bootstrap sample size rib for the m out of n 
bootstrap to estimate p^ a ' 

n b = argmin |f{JJ* -fg^J, (5.1) 

\/n<bj <n 7 

where 7 = 0.99, bj = {bj-i + d) with 60 = \fn and Q * is the (1 — a)100th percentile 

of [by 3 (f)l. — q)] 2 / pL2- This selection method for the bootstrap sample sizes is a simple 
version of those in Gotzc and Rackauskas (2001) or Bickcl and Sakov (2008) and our 
experience shows that this selected rib provides much more stable performance than 
simply using, say, rib = \fn in simulation studies. 

From Table 5, we see that for interval censored Case 1 data, smoothed WELRCI 
perform well with the selected bootstrap sample size rib, by (5.1). Compared with BWCI, 
WELRCI have better coverage for moderate sample sizes (also better than WELRCI0), 
while the average length of the confidence intervals is not significantly greater. For large 
sample sizes, WELRCI and WELRCI0 have similar performances and they both have 
slight over-coverage, but the average length of these confidence intervals is quite a bit 
less than the average length of those of BWCI. However, it should be noted that for large 



Table 5. 95% confidence intervals for 60 = F 1 (g) with exponential interval censored data 
Case 1 X ~ Exp(l), Y ~ Exp(l); Percentage of 6: 5= 1: 50.0%; 5 = 0: 50.0% 







q&6o q = 0.25, 


0o = 0.288 


g = 0.50, 


0o = 0.693 


g = 0.75, 


6 = 1.386 






Coverage 


Average 


Coverage 


Average 


Coverage 


Average 


Sample Size 


Method percentag 


;e length 


percentag 


;e length 


percentage length 


n = 


50 


1-WELRCI0 61.4 


0.385 


95.0 


1.162 


81.9 


1.321 


d = 


5 in (5.1) 


3-WELRCI 87.7 


0.710 


95.1 


1.083 


84.5 


1.431 






BWCI 




93.8 


0.962 






n = 


100 


1-WELRCI0 82.0 


0.380 


96.1 


0.936 


94.8 


1.489 


d = 


10 in (5.1) 


3-WELRCI 95.1 


0.535 


97.1 


0.909 


96.3 


1.498 






BWCI 




93.5 


0.788 






n = 


200 


1-WELRCI0 94.0 


0.360 


98.2 


0.665 


98.4 


1.212 


d = 


20 in (5.1) 


3-WELRCI 96.8 


0.401 


97.8 


0.653 


97.4 


1.199 






BWCI 




94.1 


0.626 






n = 


500 


1-WELRCI0 97.7 


0.244 


97.3 


0.391 


97.0 


0.722 


d = 


50 in (5.1) 


3-WELRCI 98.3 


0.252 


97.4 


0.391 


97.3 


0.730 






BWCI 




95.7 


0.470 






n = 


1000 


1-WELRCI0 - 




97.0 


0.258 






d = 


100 in (5.1) 3-WELRCI - 




97.2 


0.259 










BWCI 




95.0 


0.367 
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sample size n, computing WELRCI or WELRCIO is extremely time-consuming due to 
the use of (5.1). 

In regard to selection criterion (4.3) for k, simulation results show that 1-WELRCI and 

2- WELRCI perform similarly for right censored data and doubly censored data, while 

3- WELRCI generally perform noticeably better than 1-WELRCI for interval censored 
data. But, for brevity, Table 5 does not include the simulation results of 1-WELRCI. 

Overall, all simulation results presented here support Theorem 1 in Section 3 and the 
general methodology for implementation described in Section 4. The use of our rather 
simple version of smoothed quantile estimate 9 = F~ 1 (q) of (3.6) in the proposed WEL- 
RCI procedure performs very well in all of our simulation studies. 

Concluding remarks 

We have shown that the proposed smoothed WELRCI provides a consistent likelihood- 
based interval estimate for quantiles with various types of censored data, including some 
of those that have not been previously studied in the literature. Compared with existing 
methods, smoothed WELRCI perform favorably in all available simulation studies. More- 
over, the theoretical coverage accuracy equation (3.13) or (3.16) for smoothed WELRCI 
leads to the actual coverage accuracy result for right censored data and sheds light on 
further studies of coverage accuracy; see Remark 5. Finally, we note that the methods de- 
veloped in this article can easily be used to construct WELRCI for survival probabilities, 
M-statistic, trimmed mean, etc., with different types of censored data. 

6. Proofs 



Proof of Theorem l(i). Since F p in (3.2) is an increasing function on [0, W m ] with 
range [0,1], we have 



where [/,; = [Hi(W, 9q) — q]. Note that from (3.2) and (3.6), we have for rjo = q, 




(6.1) 



III 





and 



(6.2) 



i=l 



where Ui 



[Hi(W,9) — q]. To get an expression for r(Oo) in (6.1), we let 
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then, 



dH (npi \ „ npi 

XnUi —7=0 



dpi \ pi J XnUi + 7 

and, in turn, the constraints Y^LiPi^i = an( i ^JILiPi = S™=iPj = 1 §i ye 



n = ^np i = ^p i (An?7i+7)=7 =* Pi =ft/(l + (6.3) 

i=l i=l 

and A should satisfy 

m m 

fl =VnJL = "V 

1 + At/* 



m m ^ j T 



The desired solutions of (6.4) are in the interval {—U7y — U^) because, in (6.3), we 
require < 1 + XUi, 1 < i < m, and 

r(0o)>c n => U (1) <0<U {m) fL 2 >0. (6.5) 

Since g'(X) < 0, for Ao as the unique solution of g(X) = in the interval {~Ur\,—Uz}), 

m 

logr(0 o ) = -n$^p<log(l + X Q Ui). (6.6) 
i=i 

To study the asymptotic behavior of Ao, we notice that from (6.2), (6.4) and (6.6), we 
have g(Xo) = 0, g(0) = YT=i P»^» = (fl ~ Vo) and 

-(v-vo) = [g(Xo)-g(o)}=g'(t)\o 

m 

= -XoY / P l U?(l+ZU l )- 2 , 

where |£| < |Ao|. Thus, noting that (3.2) implies 

0<Hi(W,x)<l => max \Ui\ = max \Hi(W,9 ) -q\ < 1, (6.7) 

l<i<m l<i<m 

from (1 +£[/ l ) 2 < (1+ |A |) 2 we have 

m m 
i=l i=l 

Since (6.7) implies |Ao| < max{|f/^J, \U7 J} = max{g, 1 — q} = Mi, then from (3.7), 
(AS3) and Theorem 4.2.2 of Chung (1974), we have that with probability 1, 

|A | < - %|(1 + Mi) 2 < P\V - %|(1 + Mi) 2 (6.8) 
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all but finitely often, where p > is a constant. Hence, by (AS2), 

. a.s. n 

Ag — >U as n — > oo. 
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(6.9) 



To avoid overly tedious algebra, the rest of the proof will establish (3.8)-(3.10) for k = 2, 
while the method can easily be used for the case k = 4. 

We let h = g~ l , then .g(Ao) = and 5(0) = (?? — %) imply that h(0) = Ao and h(fj — r]o) = 
0, respectively. Moreover, we have 



h'(fj-Vo) = 
h"(fj-Vo) = 
h"'(fj- Vo ) = 



9'(0) fh' 

g"(Q) _2A 3 
[,9'(0)] 3 Ai ' 

3[g"(o)] 2 -gW"(Q) ^ 6(2A§-A2Ad) 
[ff'(0)] 5 Al 

10 ff 'W(x) 3 '"(x) - [ 5 '(^)] 2 «?W(.t) - 15[ 5 "(z)] 3 



(6.10) 



[</(*)] 7 

where x = /i(y) and, from Taylor's expansion, we have 



A = h(0) = h(fj-T]o) - h'(fj-r]o)(fj-r]o) + ^h"(fj - r) )(fj - ryo) 2 



- - m) 3 + (OW - %) 4 



(6.11) 



^-^o , M3, 



fJ-2 M2 



(fj-vo) +Rh, 
M2 



where fl h = - %) 4 , |£| < \v - %|, satisfying C = h(£) with |C| < |A |. Since by 

(6.7) and (6.9) we have 



/'2 



< 



(1 + IAol) 2 
|5"(C)I=2 



i5'(oi=E 



< 



/'2 



^(l + Ct/0 2 -(l-|Ao| 



E 



PiUi 



|5"'(C)I=6 
l<? (4) (C)l=24 



E 



t(i + Cf/,) 3 



< 



^(i + OA) 4 



< 



E 



i (i + CC/,) 5 



(l-|Ao|) 3: 
6 

24 



(6.12) 



< 



(l-|Ao|) E 
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from (6.9)-(6.10) and (AS3), there exists a constant Mh such that with probability 1, 

\R h \<M h \fj- Vo \ 4 (6.13) 
all but finitely often. Thus, from Taylor's expansion, we have in (6.6) 

m 

- 21ogr(0 o ) = 2nJ2Pi lo g(! + A o^) 
i=i 

m 



-|(A ^) 4 + |(l + Ci)- 5 (Ao^) 5 } 
2n(A (r) - r?o) - ttA 2 ,/^ + | a oj& 3 - j^aM + Ri 



(6.14) 



where [C*| < \W < |A | and R, = §n£™ + Ci)" 5 (A ^i) 5 . Easily, from (6.7)-(6.8), 
we know that with probability 1, 

l-Ril < §n(l - |A |)- 5 |A | 5 < n\f} - m\ 5 M Rl (6.15) 

all but finitely often, where < Mr ± < 00 is a constant. 

By plugging in (6.11) and using tedious algebra to combine the terms with the same 
rate of convergence, we obtain in (6.14) 

-21ogr(6» = <^ 1 + —2 (V - Vo) + —"hprz (V ~ Vo) ^+i?i+i? 2 , 6.16 

I 3/^ 2/4 J 

where i? 2 represents all terms in (6.16) with order n\f]— 770 | J ' , j > 5. Thus, from (6.7) and 
(6.13), we have that there exists a constant Mr 2 such that with probability 1, 

|i? 2 |<n| ? 7-%| 5 Mfl 2 (6.17) 
all but for finitely often. Therefore, (3.8)-(3.10) follow from (6.14)-(6.17). □ 
Proof of Theorem l(ii). From (3.3), (3.8) and (6.5), we have 
P{X L <8 < Xu} 

= P{-21ogr(0 o ) < -21ogc„,/} 2 >0} (6.18) 
= [P{(AW + ( Cn f(fj - Vo ) k+3 r n , k ) < ~c n ] - P{A^ < ~c n }] + P{A™ < £„}, 

where for U = C n (fj - ?7o)/VM2 and lk(U,fj) = ^ 2 Si=i «»( ? 7 _ Vo)\ we have 

AW =n-\C n ) 2 B^ =U 2 +l k (U,fj), fc=l,2,3,4. (6.19) 
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Note that with l (U,f}) = 0, from (6.7), (3.8)-(3.9) and (AS2)-(AS3), we have that with 
probability 1, [U 2 + lk(U,fi)] > ^U 2 all but finitely often. Thus, for f n ,k = (x2T n ,ki from 
(3.8), we know that [U 2 + l k (U,fj)] < {c n + \fj - Vo\ k+1 \r n ,k\U 2 ) implies 

\i] 2 < [u 2 + hiPM < (cn + ft - m\ k+1 \fn, k \u 2 ) < (c„ + \fj - m\ k+1 u 2 M r ,k), 

which, by (AS2), implies that with probability 1, \U 2 < (| - |f) - m\ k+1 M r,k)U 2 < c n 
all but finitely often. Letting Mq be the upper bound of G' Q on [a, b], since U 2 < 4c n 
implies 

\U\<2^ and \C n (r)-r] )\<2^, (6.20) 
(3.11) follows from (6.18)-(6.20), c„ = O p (l), (AS4) and 

\P{(A^ + (C n f{f, - V o) k+3 r n ,k) < ~c n } - P{A^ < c n }\ 
= \P{(U 2 + l k {U, fj) + U 2 {r) - %) fc+1 f„^) < c n } 
-P{[U 2 + l k (U,fi)]<c n }\ 

< P{(c n -\fj- 77o| fe+1 |r„, fc |L> 2 ) < [U 2 + l k (U, f))} 

<(c n + \v-Vo\ k+1 \rn,k\U 2 )} (6.21) 

< P{(c„ - A~c n M r . k \f, - ?7o| fc+1 ) < A« < (5„ + A~c n M r . k \f, - r, Q \ k+1 )} 

< F^k{c n +Ac n M^k{2^/K) k+ \C n )- {k+1) ) 

~ Fn,k(Cn - ±C n M rt k{2^ftn) k+ \C n )- (k+l) ) 

<2||F„ ifc -G || M] + M Go (%c n M rtk {2^) k+1 ){C n )- {k+1) . □ 

Proof of Corollary 1. Note that for right censored data, we have C n = ^/n, and 
that from Ren (1997), Theorem 1, we have \F n (9 ) - F n (6 )\ < \F n (9 a ) - F n (6 -)\ = 
Oa.sX^ 1 ), where O a .s.(l) is bounded with probability 1 for sufficiently large n. Thus, 

4 0) = 4 0) = n{f, Z m)2 =W 2 + O p (n-^ 2 ), (6.22) 

where rj = q, | = F n (6 ), v-i = E™ i Pi{ J i w i < 6 } - ?y ) 2 and W = ^/n{i - r?o)/V^2, be- 
cause 

2 = (1 - 2r? )| + ill = (1 - 2g)F„(0 o ) + q 2 , 

fi 2 =q 2 - 2qff + F n (6 -) + \F n (0 o ) - F n (0 o -)\O(1) = v 2 + Op^ 1 ). 
Via straightforward algebra, we have 

W 2 = n n{i -f - = ^-^o- 2 + W% - , )-^- + W 2 n , (6.23) 
(1 - 2 m )£ + r, 2 m (1 - 2 
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where for <7g,<7 and a as in Chen and Lo (1996), 

Vn(£-Vo) a -2 2 a 6-2 i 

7n = — J— , Ai=<7 -<7 , A 2 = — - 1, 

(1 - a G a G 

_ (l + A 2 )a 2 (?7o|- I+770 -?7o) fl _ Ai - <r 2 A 2 



IJO^ + AO ' ™ cr 2 + Ai • 

From (A. 7) and equation (2.13) of Chen and Lo (1996), we have 

P{|Ai| >n- 1 / 2 (logn)- 1 } = o(n- 1 / 2 ) and P{\ A 2 | > ti~ 2 / 3 } = o^" 1 / 2 ). 

Thus, there exists a constant < M a < oo such that P{|a n | > M^.} = o(n -1 / 2 ) and 
P{|/3„| > Man" 1 / 2 } =o(r7,- 1 / 2 ). Noting that (6.20) implies |t)| < (r?o + 2 v / c^n~ 1/2 ), from 
(6.22) and a similar argument used in (6.21) we have 

|P{(4 0) + n( ? 7 - r) ) 3 r nfi ) < ~c n ] - P{A^ < c n }\ < 2||F 7 - G z \\ [a , b] + O^ 1 / 2 ), 



where for d = cry (1 — t]q)/t}q and Z as the standard normal random variable, P 7 is the 
d.f. of (dj n ) 2 and is the d.f. of (dZ) 2 . Hence, from (6.18), the proof follows by noting 
that Theorem 2 of Chen and Lo (1996) implies ||F 7 - G z \\ [0j6] = 0{ rT 1 ! 2 ). □ 



Appendix 



Proof of (3.3). Since F p in (3.2) is an increasing function on [0, W m ], it is easy to 
show that r(p) = F~ 1 (q) is continuous in p. Also, note that S n can be expressed as S n = 

{r(p)\p G E n }, where J5„ = {p\pi > 0,Y,iLiPi = II™ APi/pi) npi > c n } is a compact and 
convex set in R m . Since r is continuous, from Royden (1968), page 158, we know that S n 
is a compact set in R. Since convexity implies connectivity, from Royden (1968), pages 
152-153, we know that S n is either an interval or a single point. Since S n is compact, we 
know that S n must be a closed interval [Xl, Xjj] with Xl and Xjj given by (3.4). Next, 
we show (3.3) by denoting E = {p\r(p) = 6 ,pi > 0,Y^LiPi = !}■ 

Assume r(6o) > c n . Since r is continuous and {p\pi > 0, Yl^L^Pj = 1} is a compact set, 
we know that Eq is compact and is not empty by (3.1). Thus, (3.1) and (3.4) imply that 
Xl < #0 < Xjj . 

Assume Xl < #o < Xjj . Since r is continuous with Xl and Xjj as the lower and upper 
bound on E n , respectively, we know that from the intermediate value theorem, there 
exists p Q £ E n such that r(p ) = 9q. Hence, (3.1) implies r(6o) > c„. □ 
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